33 research outputs found

    Big Data Needs Big Governance: Best Practices From Brain-CODE, the Ontario-Brain Institute’s Neuroinformatics Platform

    Get PDF
    The Ontario Brain Institute (OBI) has begun to catalyze scientific discovery in the field of neuroscience through its large-scale informatics platform, known as Brain-CODE. The platform supports the capture, storage, federation, sharing, and analysis of different data types across several brain disorders. Underlying the platform is a robust and scalable data governance structure which allows for the flexibility to advance scientific understanding, while protecting the privacy of research participants. Recognizing the value of an open science approach to enabling discovery, the governance structure was designed not only to support collaborative research programs, but also to support open science by making all data open and accessible in the future. OBI’s rigorous approach to data sharing maintains the accessibility of research data for big discoveries without compromising privacy and security. Taking a Privacy by Design approach to both data sharing and development of the platform has allowed OBI to establish some best practices related to large-scale data sharing within Canada. The aim of this report is to highlight these best practices and develop a key open resource which may be referenced during the development of similar open science initiatives

    Standardization of electroencephalography for multi-site, multi-platform and multi-investigator studies: Insights from the canadian biomarker integration network in depression

    Get PDF
    Subsequent to global initiatives in mapping the human brain and investigations of neurobiological markers for brain disorders, the number of multi-site studies involving the collection and sharing of large volumes of brain data, including electroencephalography (EEG), has been increasing. Among the complexities of conducting multi-site studies and increasing the shelf life of biological data beyond the original study are timely standardization and documentation of relevant study parameters. We presentthe insights gained and guidelines established within the EEG working group of the Canadian Biomarker Integration Network in Depression (CAN-BIND). CAN-BIND is a multi-site, multi-investigator, and multiproject network supported by the Ontario Brain Institute with access to Brain-CODE, an informatics platform that hosts a multitude of biological data across a growing list of brain pathologies. We describe our approaches and insights on documenting and standardizing parameters across the study design, data collection, monitoring, analysis, integration, knowledge-translation, and data archiving phases of CAN-BIND projects. We introduce a custom-built EEG toolbox to track data preprocessing with open-access for the scientific community. We also evaluate the impact of variation in equipment setup on the accuracy of acquired data. Collectively, this work is intended to inspire establishing comprehensive and standardized guidelines for multi-site studies

    Designing and Implementing a Privacy Preserving Record Linkage Protocol

    Get PDF
    Introduction The Ontario Brain Institute has developed Brain-CODE, an informatics platform, to support the acquisition, storage, management and analysis of multi-modal data. The standardized research data within Brain-CODE spans several brain disorders, allowing for integrative analyses, while also providing the opportunity to leverage existing clinical administrative data holdings through external linkages. Objectives and Approach Within Ontario, the majority of individuals who access the healthcare system have a unique identifier, the Ontario Health Insurance Plan (OHIP) number. The OHIP number can facilitate linkages with administrative data holdings, such as those at the Institute for Clinical Evaluative Sciences (ICES). Given that OBI is not permitted under Ontario’s privacy legislation to hold OHIP numbers, identifiers for consented participants are encrypted using a public key mechanism upon entry into Brain-CODE, where the private key is inaccessible. To facilitate linkages involving OHIP numbers between Brain-CODE and ICES, Brain-CODE Link software was co-developed by members of the Indoc Consortium. Results Brain-CODE Link allows a deterministic linkage between encrypted identifiers (OHIP numbers), without revealing participant identity. The same homomorphic encryption algorithm applied to identifiers upon entry to Brain-CODE, is applied to relevant identifiers within ICES data holdings. Encrypted identifiers from Brain-CODE are securely transferred to ICES, where a comparison computation calculates differences between the encrypted sets. These differences are sent to a semi-trusted third party, who has no access to the original data, to decrypt the differences using the private key. A zero difference indicates a set of matching identifiers. One of the main challenges during testing and development of Brain-CODE Link was ensuring the software was capable of scaling to a population level, performing a large number of comparisons, in a computationally efficient manner. Conclusion/Implications Ongoing pilot projects within the areas of epilepsy, neurodevelopment disorders, and neurodegeneration will be the first examples of linkages between Brain-CODE and ICES. Brain-CODE Link has successfully performed several billion test comparisons, indicating its suitability to function as a scalable privacy preserving record linkage to support comprehensive analyses

    Large-scale mapping of human protein–protein interactions by mass spectrometry

    Get PDF
    Mapping protein–protein interactions is an invaluable tool for understanding protein function. Here, we report the first large-scale study of protein–protein interactions in human cells using a mass spectrometry-based approach. The study maps protein interactions for 338 bait proteins that were selected based on known or suspected disease and functional associations. Large-scale immunoprecipitation of Flag-tagged versions of these proteins followed by LC-ESI-MS/MS analysis resulted in the identification of 24 540 potential protein interactions. False positives and redundant hits were filtered out using empirical criteria and a calculated interaction confidence score, producing a data set of 6463 interactions between 2235 distinct proteins. This data set was further cross-validated using previously published and predicted human protein interactions. In-depth mining of the data set shows that it represents a valuable source of novel protein–protein interactions with relevance to human diseases. In addition, via our preliminary analysis, we report many novel protein interactions and pathway associations

    EMT transcription factors snail and slug directly contribute to cisplatin resistance in ovarian cancer

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The epithelial to mesenchymal transition (EMT) is a molecular process through which an epithelial cell undergoes transdifferentiation into a mesenchymal phenotype. The role of EMT in embryogenesis is well-characterized and increasing evidence suggests that elements of the transition may be important in other processes, including metastasis and drug resistance in various different cancers.</p> <p>Methods</p> <p>Agilent 4 × 44 K whole human genome arrays and selected reaction monitoring mass spectrometry were used to investigate mRNA and protein expression in A2780 cisplatin sensitive and resistant cell lines. Invasion and migration were assessed using Boyden chamber assays. Gene knockdown of <it>snail </it>and <it>slug </it>was done using targeted siRNA. Clinical relevance of the EMT pathway was assessed in a cohort of primary ovarian tumours using data from Affymetrix GeneChip Human Genome U133 plus 2.0 arrays.</p> <p>Results</p> <p>Morphological and phenotypic hallmarks of EMT were identified in the chemoresistant cells. Subsequent gene expression profiling revealed upregulation of EMT-related transcription factors including <it>snail, slug, twist2 </it>and <it>zeb2</it>. Proteomic analysis demonstrated up regulation of Snail and Slug as well as the mesenchymal marker Vimentin, and down regulation of E-cadherin, an epithelial marker. By reducing expression of <it>snail </it>and <it>slug</it>, the mesenchymal phenotype was largely reversed and cells were resensitized to cisplatin. Finally, gene expression data from primary tumours mirrored the finding that an EMT-like pathway is activated in resistant tumours relative to sensitive tumours, suggesting that the involvement of this transition may not be limited to <it>in vitro </it>drug effects.</p> <p>Conclusions</p> <p>This work strongly suggests that genes associated with EMT may play a significant role in cisplatin resistance in ovarian cancer, therefore potentially leading to the development of predictive biomarkers of drug response or novel therapeutic strategies for overcoming drug resistance.</p

    The CAMH Neuroinformatics Platform: A Hospital-Focused Brain-CODE Implementation

    Get PDF
    Investigations of mental illness have been enriched by the advent and maturation of neuroimaging technologies and the rapid pace and increased affordability of molecular sequencing techniques, however, the increased volume, variety and velocity of research data, presents a considerable technical and analytic challenge to curate, federate and interpret. Aggregation of high-dimensional datasets across brain disorders can increase sample sizes and may help identify underlying causes of brain dysfunction, however, additional barriers exist for effective data harmonization and integration for their combined use in research. To help realize the potential of multi-modal data integration for the study of mental illness, the Centre for Addiction and Mental Health (CAMH) constructed a centralized data capture, visualization and analytics environment—the CAMH Neuroinformatics Platform—based on the Ontario Brain Institute (OBI) Brain-CODE architecture, towards the curation of a standardized, consolidated psychiatric hospital-wide research dataset, directly coupled to high performance computing resources

    Proteomic Analyses Reveal High Expression of Decorin and Endoplasmin (HSP90B1) Are Associated with Breast Cancer Metastasis and Decreased Survival

    Get PDF
    BACKGROUND: Breast cancer is the most common malignancy among women worldwide in terms of incidence and mortality. About 10% of North American women will be diagnosed with breast cancer during their lifetime and 20% of those will die of the disease. Breast cancer is a heterogeneous disease and biomarkers able to correctly classify patients into prognostic groups are needed to better tailor treatment options and improve outcomes. One powerful method used for biomarker discovery is sample screening with mass spectrometry, as it allows direct comparison of protein expression between normal and pathological states. The purpose of this study was to use a systematic and objective method to identify biomarkers with possible prognostic value in breast cancer patients, particularly in identifying cases most likely to have lymph node metastasis and to validate their prognostic ability using breast cancer tissue microarrays. METHODS AND FINDINGS: Differential proteomic analyses were employed to identify candidate biomarkers in primary breast cancer patients. These analyses identified decorin (DCN) and endoplasmin (HSP90B1) which play important roles regulating the tumour microenvironment and in pathways related to tumorigenesis. This study indicates that high expression of Decorin is associated with lymph node metastasis (p&lt;0.001), higher number of positive lymph nodes (p&lt;0.0001) and worse overall survival (p = 0.01). High expression of HSP90B1 is associated with distant metastasis (p&lt;0.0001) and decreased overall survival (p&lt;0.0001) these patients also appear to benefit significantly from hormonal treatment. CONCLUSIONS: Using quantitative proteomic profiling of primary breast cancers, two new promising prognostic and predictive markers were found to identify patients with worse survival. In addition HSP90B1 appears to identify a group of patients with distant metastasis with otherwise good prognostic features

    Applying mass spectrometry based proteomic technology to advance the understanding of multiple myeloma

    No full text
    Abstract Multiple myeloma (MM) is the second most common hematological malignancy in adults. It is characterized by clonal proliferation of terminally differentiated B lymphocytes and over-production of monoclonal immunoglobulins. Recurrent genomic aberrations have been identified to contribute to the aggressiveness of this cancer. Despite a wealth of knowledge describing the molecular biology of MM as well as significant advances in therapeutics, this disease remains fatal. The identification of biomarkers, especially through the use of mass spectrometry, however, holds great promise to increasing our understanding of this disease. In particular, novel biomarkers will help in the diagnosis, prognosis and therapeutic stratification of MM. To date, results from mass spectrometry studies of MM have provided valuable information with regards to MM diagnosis and response to therapy. In addition, mass spectrometry was employed to study relevant signaling pathways activated in MM. This review will focus on how mass spectrometry has been applied to increase our understanding of MM

    Differential analysis of membrane proteins in mouse fore- and hindbrain using a label-free approach

    No full text
    The ability to quantitatively compare protein levels across different regions of the brain to identify disease mechanisms remains a fundamental research challenge. It requires both a robust method to efficiently isolate proteins from small amounts of tissue and a differential technique that provides a sensitive and comprehensive analysis of these proteins. Here, we describe a proteomic approach for the quantitative mapping of membrane proteins between mouse fore- and hindbrain regions. The approach focuses primarily on a recently developed method for the fractionation of membranes and on-membrane protein digestion, but incorporates off-line SCX-fractionation of the peptide mixture and nano-LC-MS/MS analysis using an LTQ-FT-ICR instrument as part of the analytical method. Comparison of mass spectral peak intensities between samples, mapping of peaks to peptides and protein sequences, and statistical analysis were performed using in-house differential analysis software (DAS). In total, 1213 proteins were identified and 967 were quantified; 81% of the identified proteins were known membrane proteins and 38% of the protein sequences were predicted to contain transmembrane helices. Although this paper focuses primarily on characterizing the efficiency of this purification method from a typical sample set, for many of the quantified proteins such as glutamate receptors, GABA receptors, calcium channel subunits, and ATPases, the observed ratios of protein abundance were in good agreement with the known mRNA expression levels and/or intensities of immunostaining in rostral and caudal regions of murine brain. This suggests that the approach would be well-suited for incorporation in more rigorous, larger scale quantitative analysis designed to achieve biological significance
    corecore